PREFIX-PROJECTION Global Constraint for Sequential Pattern Mining

نویسندگان

  • Amina Kemmar
  • Samir Loudni
  • Yahia Lebbah
  • Patrice Boizumault
  • Thierry Charnois
چکیده

Sequential pattern mining under constraints is a challenging data mining task. Many efficient ad hoc methods have been developed for mining sequential patterns, but they are all suffering from a lack of genericity. Recent works have investigated Constraint Programming (CP) methods, but they are not still effective because of their encoding. In this paper, we propose a global constraint based on the projected databases principle which remedies to this drawback. Experiments show that our approach clearly outperforms CP approaches and competes well with ad hoc methods on large datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Global Constraint for Mining Sequential Patterns with GAP Constraint

Sequential pattern mining (SPM) under gap constraint is a challenging task. Many efficient specialized methods have been developed but they are all suffering from a lack of genericity. The Constraint Programming (CP) approaches are not so effective because of the size of their encodings. In [7], we have proposed the global constraint PREFIX-PROJECTION for SPM which remedies to this drawback. Ho...

متن کامل

Mining Constraint-based Multidimensional Frequent Sequential Pattern in Web Logs

In this paper we introduce an efficient strategy for discovering Web usage mining is the application of data mining techniques to discover usage patterns from Web data, in order to understand and better serve the needs of Web-based applications. Web usage mining consists of three phases, namely preprocessing, pattern discovery, and pattern analysis. This paper describes each of these phases in ...

متن کامل

Multi-Level Weighted Sequential Pattern Mining Based on Prime Encoding

Encoding can express the hierarchical relationship in the area of mining the multi-level sequential pattern, up to now all the algorithms of which find frequent sequences just according to frequency, but items have different importance in the real applications, therefore the weight constraint involved to the entire mining process is crucial. The MWSP algorithm based on the candidate generation-...

متن کامل

PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth

Sequential pattern mining is an important data mining problem with broad applications. I t is challenging since one may need to examine a combinatorially explosive number of possible subsequence patterns. Most of the previously developed sequential pattern mining methods follow the methodology of A priori which may substantially reduce the number of combinations to be examined. Howeve6 Apriori ...

متن کامل

A Survey on Algorithms for Sequential Pattern Mining

Sequential pattern mining is a very useful mining technique for various sectors like healthcare, retail business, DNA analysis etc. It generates patterns which are frequently occurring in given sequence of transactions. It uses sequence database having sequence of transactions with transaction time. In sequence database every transaction is having various items. By sequential pattern mining use...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015